Optimal Sample Size for Testing Multilocus Genotypic Effects by a Bayesian Method Using a Gibbs Sampler
نویسندگان
چکیده
The potential epistasis that may explain a large portion of the phenotypic variation for complex traits has been ignored in many genetic association studies. A Baysian method using a Gibbs sampler was introduced to draw inferences about multilocus genotypic effects based on their marginal posterior distributions by a Gibbs sampler. This method would be applied to studies for interaction effects among limited number of loci although theoretically they are applicable to all the possible interaction of millions of single nucleotide variants resulted from genomewide association study. A simulation study was conducted to provide an optimal sample size for experimental designs with this method. Data were simulated with more than 42,240 data sets produced by combined designs of number of loci (2, 3, 4, and 5 loci), within genotype variance (10 ~ 40, 16 levels), and sample size (5 ~ 100, 20 levels) in unbalanced designs with various portions of null genotypic cells (0 ~ 50%, 11 levels). Mean empirical statistical power was estimated for each data set in testing the posterior mean estimate of combination genotypic effect. Additionally, mean square prediction error was obtained from estimating the posterior mean estimate. The optimal sample sizes were provided with the prediction error > 2.0 and the statistical power >0.8 under various designs. The Baysian method using a Gibbs sampler was suggested for testing and estimating epistatic effects among limited number (2~4) of loci. Practical guidelines for determining the optimal sample size with a specific power are provided when population geneticists apply the Baysian method to their genetic association studies.
منابع مشابه
Empirical Statistical Power for Testing Multilocus Genotypic Effects under Unbalanced Designs Using a Gibbs Sampler
Epistasis that may explain a large portion of the phenotypic variation for complex economic traits of animals has been ignored in many genetic association studies. A Baysian method was introduced to draw inferences about multilocus genotypic effects based on their marginal posterior distributions by a Gibbs sampler. A simulation study was conducted to provide statistical powers under various un...
متن کاملBayesian Inference on Multilocus Genotypic Effects Using a Gibbs Sampler
Simultaneous analysis of multiple genetic variants is an essential strategy for understanding genetic dissection of complex traits, focusing epistasis along with additive and dominance effects of individual genes. Although phenotypic variation for complex traits might be largely explained by epistasis, most analyses have excluded the possibility of epistasis, especially with lack of individual ...
متن کاملBlock Gibbs sampling for Bayesian random effects models with improper priors: Convergence and regeneration
Bayesian versions of the classical one-way random effects model are widely used to analyze data. If the standard diffuse prior is adopted, there is a simple block Gibbs sampler that can be employed to explore the intractable posterior distribution. In this paper, theoretical and methodological results are developed that allow one to use this block Gibbs sampler with the same level of confidence...
متن کاملBayesian Sample Size Determination with Lowest Cost by Using Numerical Methods
In this article, the method of determining the optimal sample size is based on Linex asymmetric loss function and has been expressed through Bayesian method for normal, Poisson and exponential distributions. The desirable sample size has been calculated through numerical method. In numerical method, the average posterior risk is calculated and then it is added to the Lindley linear ...
متن کاملThe Analysis of Bayesian Probit Regression of Binary and Polychotomous Response Data
The goal of this study is to introduce a statistical method regarding the analysis of specific latent data for regression analysis of the discrete data and to build a relation between a probit regression model (related to the discrete response) and normal linear regression model (related to the latent data of continuous response). This method provides precise inferences on binary and multinomia...
متن کامل